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ABSTRACT 

This  study  examines  enlisted  attrition  behavior  for  the 
U.S.  Coast  Guard  and  develops  a  model  that  projects  attrition 
figures.  Survival  analysis  techniques  are  used  to  analyze  the 
empirical  attrition  behaviors  associated  with  an  individuals 
sex,  race,  marital  status,  and  military  occupational  skill 
(MOS) .  In  this  study  males  tend  to  have  higher  survival 
probabilities  than  females,  non- Caucasians  higher  than 
Caucasians,  and  married  persons  higher  than  those  not  married. 
Aviation  MOSs  have  the  highest  survival  probabilities  and 
technical  MOSs  have  the  lowest. 

Modelled  survivor  functions  are  developed  for  two 
paygrades  because  each  contain  small  personnel  inventories. 
These  modelled  survivor  functions  do  not  fit  the  data  as  well 
as  desired  but  are  nonetheless  used  pending  the  development  of 
sharper  alternatives. 

Finally  a  counting  model  based  on  the  Binomial 
Distribution  is  developed  that  projects  monthly  enlisted 
attrition  figures. 
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I .   INTRODUCTION 
A.   PROBLEM  STATEMENT 

The  United  States  Coast  Guard  (CG)  is  a  military  service 
operating  within  the  Department  of  Transportation.   Its  main 
missions  are  aids  to  navigation,  boating  safety,  defense 
operations,  maritime  law  enforcement,  environmental  response, 
marine  inspection,  ice  operations,  marine  licensing,  marine 
science,  search  and  rescue,  port  security  and  safety,  and 
waterways  management.   The  CG  maintains  an  active  duty 
hierarchical  personnel  structure  with  approximately  6,000 
officers  and  30,000  enlisted  members.    The  CG  Personnel 
Workforce  Planning  (PWP)  office  located  in  Washington,  D.C. 
has  the  overall  responsibility  for  projecting  the  active  duty 
enlisted  personnel  needs  of  the  CG.   Their  responsibilities 
include  the  calculating  of  personnel   stocks,   promotion 
requirements,  attritions,  and  recruitment  needs.  The  current 
method  for  predicting  monthly  enlisted  attrition  by  paygrade 
is  to  calculate  a  running  average  or  mean  of  past  monthly 
attrition  and  use  this  mean  as  a  forecast  of  future  attrition. 
This  forecasting  method  is  crude,  ignores  many  issues,  and  is 
not  accurate  enough  to  meet  the  needs  of  the  CG. 


Why  does  a  person  separate  from  the  CG?  What  are  the 
characteristics  associated  with  this  person?  Are  these 
characteristics  the  same  for  each  paygrade?  Previous  CG 
studies  have  focussed  on  the  first  question  but  have  not 
looked  at  attrition  data  analytically.  The  first  question 
will  not  be  addressed.  This  thesis  will  look  at  the  next  two 
questions  and  will  propose  a  method,  based  on  a  survival 
analysis  of  CG  enlisted  attrition  data,  for  calculating 
monthly  personnel  attrition.  This  study  is  a  first  attempt  at 
examining  CG  enlisted  attrition  using  data  analysis  tools.  It 
is  designed  to  provide  a  procedure  for  studying  attrition,  and 
for  developing  a  basic  model  that  calculates  monthly  attrition 
which  can  be  expanded  by  future  studies . 

B.   COAST  GUARD  ENLISTED  PERSONNEL  BACKGROUND  INFORMATION 

The  Coast  Guard  enlisted  personnel  structure  is  composed 
of  9  paygrades,  E-l  to  E-9,  lowest  to  highest  respectively. 
(There  is  an  E-10  position  filled  by  one  person  but  this 
thesis  does  not  address  this  paygrade.)  Paygrade  E-l  is 
composed  of  persons  attending  CG  basic  training  school  and 
paygrade  E-2  is  composed  of  persons  who  have  completed  basic 
training  school,  have  been  assigned  to  active  duty  commands, 
and  are  preparing  to  attend  a  CG  specialty  school  (also  called 
A- school) .  Paygrades  E-3  through  E-9  are  divided  into  24 
active  duty  military  occupational  skills  (MOS)  or  sub- 
specialties  listed  below: 


Aviation  Machinist's  Mate  (AD) 

Aviation  Electrician's  Mate  (AE) 

Aviation  Structural  Mechanic  (AM) 

Aviation  Survivalman  (ASM) 
Aviation  Electronic  Technician  (AT) 

Boatswain's  Mate  (BM) 

Damage  Controlman  (DC) 

Electrician's  Mate  (EM) 

Electronics  Technician  (ET) 

Fire  Control  Technician  (FT) 

Gunner's  Mate  (GM) 

Health  Services  Technician  (HS) 

Machinery  Technician  (MK) 

Musician  (MU) 

Marine  Science  Technician  (MST) 

Public  Affairs  Specialist  (PA) 

Quartermaster  (QM) 

Radarman  (RD) 

Radioman  (RM) 

Storekeeper  (SK) 

Subsistence  Specialist  (SS) 

Sonar  Technician  (ST) 

Telephone  Technician  (TT) 

Yeoman  (YN) 


An  enlisted  man  (EM)  receives  an  MOS  after  either 
completing  a  specialty  school  or  by  completing  an  intensive 
on-the-job  training  program.  Both  methods  are  usually 
completed  while  the  EM  is  in  the  E-3  paygrade.  An  E-3  who  has 
not  obtained  an  MOS  is  called  nonrated;  E-3's  through  E-9's 
hold  only  one  MOS  at  a  time  and  usually  keep  that  MOS 
throughout  their  careers. 

Promotions  to  the  next  highest  paygrade  are  performed  by 
MOS  and  are  dictated  by  the  needs  of  the  CG.  Promotions  to 
paygrades  E-5  through  E-9  are  vacancy  driven  and  promotions  to 
paygrades  E-3  and  E-4  occur  on  qualification  for  advancement. 
Promotion  to  paygrade  E-2  occurs  upon  completion  of  basic 
training. 

Enlisted  personnel  sign  service  obligation  contracts 

(enlistments)  that  require  the  person  to  serve  in  the  CG  for 
a  set  number  of  years.   Enlistments  usually  last  four  years. 

(There  have  been  two  year  enlistments  but  these  have  been 
terminated.)  Upon  completion  of  an  enlistment  and  upon 
approval  by  the  CG,   a  person  may  sign  a  new  contract 

(reenlist) ,  or  separate  from  the  CG.  Enlisted  attrition 
generally  occurs  because  of: 

retirement  -  minimum  20  years  active  duty  service, 

non-reenlistments  -  an  EM  chooses  not  to  reenlist  or  the 
CG  chooses  not  to  reenlist  the  person, 


administrative  reasons  -  an  EM  may  depart  the  CG  prior  to 
the  end  of  his/her  enlistment  "for  the  convenience  of  the 
government " , 

death  or  disability, 

selection  to  an  officer  program. 

For  the  purposes  of  this  study  the  term  separation  refers  to 
a  person  leaving  the  CG  for  any  of  the  above  reasons. 

C .    PROCEDURE 

This  thesis  examines,  by  paygrade,  individual  personnel 
characteristics  associated  with  attrition  and  develops  a  model 
that  calculates  monthly  attrition.  Survival  analysis 
techniques  are  used  to  study  behaviors  associated  with  these 
attrition  characteristics.  Survival  analysis  is  a  data 
analysis  technique  that  is  useful  in  examining  failure  time 
data.  Many  of  the  visible  applications  have  been  in  medical 
research  where  a  typical  "failure"  is  the  death  of  a  patient 
who  had  been  subjected  to  some  type  of  treatment.  Obviously 
it  would  not  be  practical  to  wait  for  all  the  patients  to 
"fail"  before  analyzing  the  data.  It  is  more  convenient  to 
end  the  experiment  at  some  specified  time  and  do  an  analysis 
of  all  the  data.  The  observables  that  did  not  "fail"  by  the 
end  of  the  data  collection  period  are  called  censored 


observations.  They  must  be  treated  differently  since  their 
failure  times  are  yet  to  occur  and  are  hence  unknown. 
Survival  analysis  allows  for  censored  failure  times  and 
provides  techniques  for  including  them  in  the  analysis.  The 
primary  tool  used  in  the  survival  analysis  will  be  the 
estimation  of  survivor  functions  or  curves.  These  curves 
provide  probabilities  that  a  person  survives  up  to  and  beyond 
a  set  time,  t.  Their  estimators  are  often  called  product 
limit  estimators  because  of  the  technique  used.  Survival 
probabilities  are  the  compliment  of  attrition  rates  and  are 
needed  to  calculate  expected  monthly  attrition  figures. 

The  personnel  data  are  cross  classified  by  paygrade  and 
each  paygrade  is  partitioned  by  certain  individual  personnel 
characteristics.  Paygrades  E-l  and  E-2  will  be  partitioned  by 
the  characteristics  of  sex,  marital  status,  and  race. 
Paygrade  E-3  will  be  partitioned  by  these  same  characteristics 
and  by  whether  the  person  had  obtained  an  MOS  or  was  nonrated. 
Paygrades  E-4  through  E-9  will  be  partitioned  by  the  same 
characteristics  as  E-ls  and  E-2s  and  by  CG  MOS.  MOSs  will  be 
grouped  according  to  categories  listed  in  Table  1  on 
page  7. 


TABLE  1.   MOS  GROUPINGS 


Group 

Rates 

Aviation 

AD,  AE, 

AM, 

ASM,  AT 

Administrative 

YN,  SK 

Technical 

EM,  ET, 

FT, 

MST,  ST,  TT 

Skilled 

HS,  SS, 

MU, 

PA 

CG  Professional 

BM,  DC, 

GM, 

MK,  QM,  RD,  RM 

MOSs  are  grouped  for  two  reasons.  MOSs  within  the  same 
group  have  similarities  that  should  cause  them  to  exhibit 
common  attrition  behavior  and  hence  have  common  survival 
probability  values.  The  MOSs  within  each  category  all  have 
several  common  attributes;  specifically,  the  type  of  training 
needed  to  acquire  the  MOS,  the  type  of  duty  or  jobs  associated 
with  those  MOSs,  and  the  possible  civilian  employment 
opportunities  available  for  persons  with  these  MOSs.  MOSs  are 
also  grouped  to  increase  the  personnel  inventory  within  each 
paygrade  required  for  a  survival  analysis.  Chapter  II 
contains  an  explanation  on  these  MOS  groups  and  examines  the 
data  used  in  this  study. 

For  most  MOSs  stocks  are  large  and  empirical  survival 
probabilities  can  be  used.  Modelled  survival  probabilities 
are  needed  if  personnel  stocks  are  smaller  than  preferred. 
Small  personnel  stocks  are  subject  to  large  data  variability; 
models  are  developed  to  provide  stability.   Paygrades  E-8  and 


E-9  could  be  considered  too  small  to  use  empirical  survival 
probabilities,  thus  regression  models  are  examined  in  Chapter 
IV  for  these  paygrades. 

A  model  that  projects  an  expected  number  (and  variance)  of 
monthly  attritions  will  be  developed  and  demonstrated  in 
Chapter  V.  This  model  is  based  on  principles  of  the  Binomial 
distribution.  Paygrades  will  be  partitioned  several  times  to 
produce  multiple  binomial  experiments  from  which  expected 
values  and  variances  can  be  calculated.  The  product  limit 
estimates  generated  by  the  survival  analysis  will  correspond 
to  the  probabilities  of  success  associated  with  a  binomial 
experiment . 

Finally  conclusions  and  recommendations  are  contained  in 
Chapter  VI.  Since  this  is  the  first  time  in  recent  years  that 
enlisted  attrition  has  been  viewed  using  operations  analysis 
tools,  most  of  the  conclusions  should  be  unique.  It  is 
expected  that  additional  studies  utilizing  more  refined  data 
will  be  needed  to  better  understand,  analyze,  and  predict  CG 
enlisted  attrition. 


II.   DATA  OVERVIEW 

A.   POPULATION 

Data  for  this  thesis  were  provided  by  the  CG  Pay  and 
Personnel  Center  located  in  Topeka,  Kansas.  The  data  were 
provided  in  two  sets.  The  first  set  contained  individual 
personnel  records  for  all  persons  who  separated  during  FY  91 
(1  November  1990  -  31  October  1991)  .  The  second  set  contained 
individual  personnel  records  for  all  enlisted  persons  who  were 
on  active  duty  the  last  day  of  FY  91  (31  October  1991)  .  This 
second  data  set  is  in  a  sense  a  "snapshot"  of  the  GC  EM 
composition  on  31  October  1991.  In  both  data  sets  each 
personnel  record  contained  the  persons  sex,  MOS  (if 
applicable)  ,  rank,  marital  status,  ethnic  origin,  date  of 
current  enlistment,  and  date  of  separation  (if  applicable)  . 
Each  persons  length  of  service  (in  months)  can  be  calculated 
from  the  data  entries  date  of  current  enlistment  and  date  of 
separation.  It  should  be  noted  that  the  entry,  date  of 
current  enlistment,  did  not  necessarily  correspond  to  when  the 
person  first  joined  the  CG.  This  affects  the  calculation  of 
a  persons  time  in  service.  In  certain  cases  the  CG  allows  an 
EM  to  separate  from  the  CG  and  then  allows  the  person  to 
rejoin  the  CG  in  the  future  at  his  or  her  past  paygrade.  Such 
persons'  past  service  time  is  not  ignored.    The  person 


maintains  all  accumulated  service  time  for  pay  and  retirement 
purposes.  This  condition  was  not  reflected  in  the  data 
provided  for  this  study.  There  are  numerous  records  of 
senior  EMs  having  relatively  little  time  in  service  when  in 
all  likelihood  they  have  quite  a  bit  of  service  time.  This 
situation  is  most  prominent  in  the  survival  analysis  results 
of  Chapter  III.  Appendix  A  contains  procedures  and  Fortran 
programs  needed  to  access  and  use  the  data. 

B .  LIMITATIONS 

Analyzing  only  one  years  amount  of  personnel  data  presents 
two  problems.  The  first  and  foremost  is  that  the  data  lack 
temporal  stability.  More  than  one  years  worth  of  data  is 
required  to  provide  stability.  Attrition  study  conclusions 
and  modelling  results  based  on  unstable  data  are  not  as 
reliable  as  if  they  were  based  on  time  tested  data.  Secondly, 
the  data  may  not  have  been  as  reliable  as  preferred  because 
data  inputs  could  not  be  verified.  The  FY  91  data  were  placed 
in  storage  by  the  Coast  Guard  PPC  and  were  not  accessible 
during  the  study.  Recommendations  contained  in  Chapter  VI 
address  these  shortcomings  further. 

C.  MOS  GROUPS 

As  stated  in  Chapter  I  MOS  groups  were  formed  because  of 
common  attributes  associated  within  each  group.  These 
attributes  are  listed  in  this  section. 
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1.  Aviation 

Individuals  with  aviation  MOSs  have  common  A- school 
characteristics  and  common  duty  stations.  (The  CG  has  three 
basic  types  of  duty  stations:  ashore,  afloat,  and  air 
stations.)  The  aviation  A-schools  (specialty  school)  waiting 
lists  are  relatively  long  compared  to  the  other  MOS  groups. 
This  lengthens  a  persons  service  time  in  paygrade  E-3,  thus 
increasing  the  amount  of  time  needed  for  promotion  to  E-4. 
Unlike  many  MOSs,  individuals  within  this  group  are  limited  in 
their  types  of  duty  stations.  They  spend  most  of  their  entire 
career  at  CG  air  stations. 

2.  Administrative 

These  MOSs  are  again  grouped  because  of  their  A- school 
characteristics,  choice  of  duty  locations,  and  possible 
civilian  employment  opportunities.  The  administrative  A- 
schools  have  short  waiting  lists  and  short  school  durations. 
Promotion  to  E-4  in  this  MOS  group  is  relatively  fast  compared 
to  other  groups.  Administrative  persons  also  can  serve  in 
almost  any  CG  duty  station  or  command.  Since  these  persons 
skills  are  mainly  clerical  in  nature  it  is  felt  that  their 
civilian  employment  opportunities  are  limited. 

3 .  Technical 

Persons  with  technical  MOSs  share  the  same  type  of 
characteristics  as  the  administrative  MOS  group,  but  with  two 
opposite  attributes.  Their  A-schools  have  long  waiting  lists, 
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are  long  in  duration,  and  have  low  graduation  rates  (as 
compared  to  the  other  MOS  group  A- schools)  .  Like  the 
administrative  group,  persons  in  this  group  can  serve  in 
almost  any  CG  command.  However,  unlike  the  above  group, 
technically  grouped  persons  probably  have  more  civilian  job 
opportunities . 

4.  Skilled 

This  group  is  characterized  by  their  scope  of  possible 
civilian  employment  opportunities.  Persons  within  this  group 
probably  have  marketable  skills,  possibly  not  as  marketable  as 
the  technical  group  but  more  marketable  than  any  other  rate 
group . 

5.  CG  Professional 

This  group  is  characterized  by  MOS  obtainment  and 
employment  opportunities.  As  discussed  in  Chapter  I,  MOSs  are 
obtained  by  graduating  from  an  A- school  or  by  completing  an 
on- the  job  training  program.  This  group  is  the  only  MOS  group 
that  incorporates  on-the-job  training  programs  for  a  number  of 
their  MOSs.  MOSs  are  also  designed  for  CG  specific  functions 
(ie.  shipboard  machinists,  CG  weapons  technicians,  Shipboard 
navigators,  etc.).  These  skills  have  limited  value  for 
c  i vi 1 ian  empl oyment . 
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D.   SUMMARY  OP  PERSONNEL  STOCKS 

Table  2  provides  a  summary  of  personnel  stocks 
partitioned  by  the  rank,  sex,  race,  and  marital  status  for  all 
CG  EMs  who  were  on  active  duty  as  of  31  October  1991. 


TABLE  2 

.   ACTIVE  DUTY  PERSONNEL 

STOCKS  FY  91 

Pay- 
grade 

Total 

Male 

Female 

Cauc. 

Non- 
Cauc. 

Mar. 

Not 
Mar. 

E-l 

732 

622 

110 

595 

137 

68 

664 

E-2 

3383 

3027 

356 

2776 

607 

622 

2761 

E-3 

3651 

3358 

293 

3031 

620 

1256 

2395 

E-4 

7064 

6364 

700 

6043 

1021 

3641 

3423 

E-5 

5904 

5410 

494 

4830 

1074 

4278 

1626 

E-6 

5748 

5427 

321 

4878 

870 

4805 

943 

E-7 

2764 

2696 

68 

2436 

328 

2436 

328 

E-8 

569 

564 

5 

501 

68 

517 

52 

E-9 

289 

289 

0 

256 

33 

270 

19 

Total 

30,104 

27,757 

2347 

25,346 

4758 

17,893 

12,211 

Note  that  race  is  divided  into  two  categories:  Caucasian 
and  non- Caucasian.  There  are  relatively  few  (as  compared  to 
the  total  EM  population)  minorities  in  the  CG.  All  minorities 
were  grouped  to  increase  the  personnel  stocks  within  the  race 
category.  As  can  be  seen  the  CG  is  a  male,  Caucasian 
dominated  service.  Table  3  on  page  14  provides  the  same 
statistics  for  Coast  Guardsmen  who  separated  during  FY  91. 
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TABLE  3 

.   PERSONNEL  STOCKS  FOR  PERSONS  SEPARATING  DURING  FY  91 

Pay- 
grade 

Total 

Male 

Female 

Cauc. 

Non 

Cauc . 

Married 

Not 
Married 

E-l 

779 

637 

142 

601 

178 

83 

696 

E-2 

522 

438 

84 

429 

93 

115 

407 

E-3 

831 

731 

100 

712 

119 

276 

555 

E-4 

1248 

1094 

154 

1079 

169 

602 

646 

E-5 

438 

398 

40 

402 

36 

281 

157 

E-6 

385 

369 

16 

348 

37 

317 

68 

E-7 

315 

305 

10 

271 

44 

277 

38 

E-8 

79 

77 

2 

73 

6 

73 

6 

E-9 

46 

46 

0 

45 

1 

39 

7 

Total 

4643 

4095 

548 

3960 

683 

2063 

2850 

Approximately  the  same  proportions  of  males  to  females, 
Caucasians  to  non- Caucasians,  and  married  persons  to  persons 
not  married  separated  from  the  CG  as  did  those  who  did  not 
separate  during  FY  91. 

Table  4  on  page  15  provides  on  a  summary  of  personnel 
stocks  for  FY  91  active  duty  personnel  by  MOS  group. 
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TABLE  4.   FY  91  ACTIVE  DUTY  PERSONNEL  STOCKS  BY  MOS  GROUP 


Paygrade 

Aviation 

Admin. 

Technical 

Skilled 

CG  Pro. 

E-3 

20 

41 

84 

71 

333 

E-4 

1142 

970 

1031 

715 

3206 

E-5 

921 

949 

781 

490 

2763 

E-6 

748 

899 

837 

527 

2737 

E-7 

262 

353 

452 

260 

1437 

E-8 

77 

88 

98 

55 

251 

E-9 

43 

40 

38 

18 

150 

Total 

3213 

3340 

3321 

2136 

10,877 

The  numbers  in  paygrade  E-3  are  small  because  not  all  E-3s 
have  an  MOS.  Overall  approximately  48%  of  all  personnel 
belong  to  the  CG  professional  MOS  group.  About  14%  belong  in 
the  aviation  group;  15%  belong  in  the  administrative  group; 
14%  belong  in  the  technical  group;  and  9%  belong  in  the 
skilled  rate  group.  Percentages  within  each  paygrade  are 
roughly  the  same.  Table  5  on  page  16  provides  the  same 
statistics  for  personnel  who  separated  during  FY  91. 
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TABLE  5.   FY  91  SEPARATION  PERSONNEL  STOCKS  BY  MOS  GROUP 


Paygrade 

Aviation 

Admin. 

Technical 

Skilled 

CG  Pro. 

E-3 

1 

5 

12 

23 

65 

E-4 

75 

157 

180 

100 

733 

E-5 

26 

49 

91 

26 

246 

E-6 

31 

46 

51 

36 

221 

E-7 

22 

51 

23 

26 

193 

E-8 

11 

15 

14 

8 

31 

E-9 

3 

6 

3 

3 

31 

Total 

169 

328 

373 

221 

1518 

E.   DATA  PARTITIONING 

The  data  was  partitioned  by  rank  to  form  9  paygrades. 
Each  paygrade  was  further  divided  by  the  personnel 
characteristics  sex,  race,  marital  status,  and  MOS  groups 
listed  in  Table  1  on  page  7. 

1.  Paygrades  E-l  and  E-2 

These  paygrades  were  divided  by  the  sex,  race,  and  the 
marital  status  of  the  individual.  Marital  status  indicated 
whether  or  not  a  person  was  currently  married. 

2 .  Paygrade  E-3 

In   addition   to   the   above,   this   paygrade   was 
partitioned  by  the  obtainment  of  an  MOS.   The  person  either 
had  obtained  one  of  the  MOSs  listed  in  Chapter  I  or  had  not 
(nonrated) . 
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3.  Paygrades  E-4  through  E-6 

These  paygrades  were  also  partitioned  by  sex,  race, 
and  marital  status.  Additionally  they  were  partitioned  by  MOS 
groups . 

4.  Paygrades  E-7  through  E-9 

These  paygrades  were  divided  by  race,  marital  status, 
and  MOS  group.  Sex  was  not  examined  because  of  the  scarcity 
of  females  in  these  paygrades. 
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III.   SURVIVAL  ANALYSIS 

A.  INTRODUCTION 

As  stated  in  Chapter  I,  survival  analysis  provides  a 
method  for  analyzing  censored  failure  time  data.  Failure 
times  in  this  study  refer  to  the  number  of  months  a  person 
serves  on  active  duty  prior  to  separation.  Failure  times  are 
assumed  to  be  discrete  and  are  rounded  up  to  the  next  whole 
month  figure.  For  example,  if  a  person  separates  after  31 
months,  six  days  of  service  his  or  her  failure  time  is  32 
months.  This  chapter  reviews  the  theory  of  survival  analysis 
and  graphically  compares  survivor  functions  of  individual 
paygrades  that  have  been  partitioned  by  single  personnel 
characteristics.  Appendix  C  contains  the  SAS  LIFETEST 
procedure  and  Fortran  code  required  to  estimate  the  survivor 
functions. 

B.  COMPUTATION  OF  SURVIVOR  FUNCTIONS 

Let  T  be  a  nonnegative  random  variable  representing  the 
separation  time  of  an  EM  from  a  homogeneous  population.  The 
survivor  function  is  defined  to  be  the  probability  that  T  is 
at  least  as  great  as  some  time,  t, 
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S(t)  =Pr(T*t)  ;  0  <;  t  <  »   .  (3.1) 


From  a  simple  random  sample,  this  can  be  estimated  by 


git)  -i  -  f  num^ez  °f  observations  with  failure  time  4  t,     ,^   2, 

total  number  of  observations 


If  T  is  a  discrete  random  variable  taking  values  xt  <  Xj  ...  < 
Xj,  with  associated  probability  function  f  (Xj)  =  Pr(T=Xi),  then 

5(t)=J]f(xi)if(xi-t)/  (3.3) 

i=l 

where  H  is  the  Heaviside  function,  H(x)={0  if  x<0;  1  if  xaO} 
[Ref .  l:pp.  5-7] . 

The  presence  of  censored  data,  however,  weakens  the 
estimation  of  the  survivor  function  in  the  right  tail.  All 
that  can  be  safely  assumed  about  an  observation  censored  at 
time  t  is  that  the  unobserved  failure  time  is  greater  than  t. 
This  condition  is  relieved  by  using  an  alternate  formulation 
of  the  survivor  function  using  hazard  rates.  For  discrete 
random  variables,  the  hazard  rate,  Xt,  is  the  probability  that 
the  observation  will  immediately  fail  at  time  t  given  is  has 
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survived  to  time  t.    In  the  discrete  case  this  can  be 
estimated  by 


y      number  of  observations  that  fail  at  t  ,..  .» 

t    number  of  observations  with  failure  time   >  t' 


Using  hazard  rates  [Ref.  3],  the  survivor  function  may  be 
expressed  as 


S(t)=  H     (1-Aj)  .  (3.5) 


where  X{  is  the  hazard  rate  at  Xj  for  i=l,...,j.  Estimating  Xj 
would  allow  an  estimation  of  the  survivor  function,  S(t). 
Chapter  2  of  Reference  1  contains  a  detailed  explanation  of 
the  computation  of  the  estimated  survivor  function. 

C.   SURVIVAL  ANALYSIS  RESULTS 

In  all  paygrades  the  marital  status  characteristic  and  sex 
characteristic  behaviors  were  identical;  married  persons 
appeared  to  have  a  higher  probability  of  survival  than  those 
who  were  not  married,  and  males  tended  to  have  higher  survival 
probabilities  than  did  females.  (See  Figures  1  and  2  on  page 
40.)  Non- Caucasians  in  paygrades  E-3  through  E-9  had  higher 
survival  probabilities  than  did  Caucasians.  Results  for  other 
characteristics  are  given  below. 
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1.  E-l 

Caucasians  and  non- Caucasians  have  about  the  same 
survivor  functions  until  about  two  months.  After  two  months 
Caucasians  tend  to  have  higher  survival  probabilities.  (See 
Figure  3  on  page  41.) 

2.  E-2 

Caucasians  had  higher  survival  probabilities  than  did 
non- Caucasians  up  to  about  20  months.  (See  Figure  4  on  page 
41.)  After  that  non- Caucasians  had  higher  survivor 
probabilities.  Also  note  the  significant  drop  in 
probabilities  at  24  months.  The  CG  for  a  brief  time  allowed 
two  year  enlistments.  This  drop  corresponds  to  enlisted 
persons  separating  at  the  end  of  a  2  year  enlistment. 

3.  E-3 

Individuals  possessing  an  MOS  had  higher  survival 
probabilities  than  did  nonrates  after  22  months.  (See  Figure 
5  on  page  42 . ) 

4.  E-4  and  E-5 

The  aviation  MOS  group  in  both  paygrades  had  the 
highest  survival  probabilities  and  the  technical  group  had  the 
lowest.  The  remaining  three  groups  survival  functions  fell  in 
between  the  aviation  and  technical  survivor  functions.  (See 
Figure  6  on  page  42 . ) 
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5.  E-6 

All  MOS  groups  in  this  paygrade  were  very  similar  as 
shown  in  Figure  6  on  page  42.  All  had  very  high  probabilities 
of  survival  and  all  were  very  similar  in  shape.  Again  the 
aviation  group  had  the  highest  survival  probabilities. 

6.  E-7,  E-8,  and  E-9 

The  survivor  functions  in  all  three  paygrades  appeared 
fairly  flat  with  slightly  decreasing  slopes  until  about  220 
months.  (See  Figure  7  on  page  43.)  After  220  months  all 
survivor  functions  decreased  at  about  the  same  rate.  All 
survivor  functions  within  these  paygrades  are  characterized  by 
high  survival  probabilities  with  the  aviation  group  having  the 
highest  probabilities  of  survival. 
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IV.   MODELLED  SURVIVOR  FUNCTIONS 

A.  INTRODUCTION 

Regression  models  are  typically  used  to  model  survival 
experiences  from  homogeneous  populations  [Ref.  1]  .  These 
models  allow  for  failure  times  to  be  dependent  on  observed 
explanatory  variables.  These  explanatory  variables  can  be 
either  qualitative  or  quantitative.  Two  common  regression 
models  are  the  exponential  regression  model  and  the  weibull 
regression  model.  Weibull  regression  models  will  be 
constructed  and  analyzed  for  paygrades  E-8  and  E-9.  The 
primary  reference  for  this  chapter  is  Chapter  2  and  Chapter  4 
of  Reference  1. 

B.  WEIBULL  REGRESSION  MODEL 
1.   Weibull  Distribution 

The  Weibull  distribution  is  a  generalization  of  the 
exponential  distribution  that  allows  for  a  power  dependence, 
p,  of  the  hazard  of  time.  The  Weibull  distribution  is  in  fact 
an  exponential  distribution  when  p=l.  The  probability  density 
function  (pdf)  of  the  Weibull  is 

f(t)=Xp(kt)P-1e(-{Xt)P)     for   t*0.         (4.1) 
The  hazard  function  is 
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X(t)=Xp(\t)p-x  (4.2) 

and  the  survivor  function  is 

5(t)=e(-Ut)*)  m  (4>3) 

A  convenient  check  to  verify  that  data  fits  the 
Weibull  distribution  is  to  plot  the  log [-log  S(t)]  versus  log 
t(S(t)  is  the  empirical  estimate  of  the  survivor  function). 
If  the  data  can  be  fitted  by  the  Weibull  distribution,  the 
plot  should  provide  a  relatively  straight  line  with  the  slope 
of  the  line  roughly  estimating  the  value  of  p. 

2.   Log-Linear  Regression  Model 

The  Weibull  distribution  can  be  generalized  to  obtain 
a  Weibull  regression  model  by  allowing  the  failure  rate  to  be 
a  function  of  a  vector  of  qualitative  personnel 
characteristics,  X.  In  this  application  X  is  a  row  vector 
consisting  of  the  binary  variables  Xaviation,  Xadminigtrative ,  X,^^,  Xgkilkd, 
and  Xcgprofe„iooal  which  take  on  a  value  of  1  if  the  observation 
falls  into  the  subscripted  category,  and  are  0  otherwise. 

If  the  conditional  (on  X)  hazard  is 


\(t\X)=\p(\t)P-1exfi 
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then  the  conditional  density  of  T  is 

f  (t\X)  =Xp(Xt)'-1e^e-[(X')'^]  (4.4) 

where  /8=  {@lt  .  .  .  ,/3n)  is  a  vector  of  regression  parameters.  In 
terms  of  Y=log  T,  equation  (4.4)  is  the  linear  model 

Y=a   +  X(3'  +  aW  (4.5) 

where  a=-log  X,  a=p"1,  @*=-aP,  and  W  has  the  extreme  value 
distribution.  In  equation  (4.5)  the  personnel  characteristics 
act  additively  on  the  time  scale,  vice  multiplicatively  in 
equation  (4.4).   [Ref.  1] 

3.   Modelled  Survivor  Function 

Equation  (4.5)  is  known  as  an  accelerated  failure  time 
model.  It  is  characterized  by  the  scaling  of  the  failure  time 


T=e'x&T0 


where  T0  is  a  failure  time  from  a  baseline  when  the  values  of 
X  are  all  zero.  Using  this  time  scaling  the  modelled  survivor 
function  is 
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S(t)=e-^""ty  (4.6)  . 

In  equation  (4.6),  S(t)  can  be  estimated  by  inserting 
appropriate  values  for  X  and  t,  and  estimated  values  of  /3,  X, 
and  p.  This  will  produce  a  modelled  survivor  function  which 
can  be  plotted  to  produce  curves  like  those  in  Chapter  III. 
If  this  modelled  survivor  function  is  similar  (in  shape  and 
magnitude)  to  the  empirical  survivor  function,  then  the  model 
can  be  used.  If  it  is  not  similar,  then  it  possibly  should 
not  be  used. 

C .   RESULTS 

The  partitioned  data  sets  were  regressed  using  the  SAS 
LIFEREG  procedure  [Ref.  4]  to  obtain  estimates  for  /?,  X,  and 
p.  These  values  along  with  appropriate  values  for  X  were 
inserted  into  equation  (4.6)  to  estimate  modelled  survivor 
functions.  Figures  8  through  13  on  pages  43  through  46 
contain  plots  of  these  modelled  functions  against  empirical 
survivor  functions.  Generally  the  modelled  survivor  functions 
do  not  fit  the  data  as  well  as  desired. 

Nonetheless,  modelled  survivor  functions  for  these  two 
paygrades  will  be  used  pending  the  development  of  sharper 
alternatives.  The  partitioned  data  sets  are  so  small,  it  is 
felt  that  a  model  will  produce  the  most  reliable  and  stable 
probability  estimates. 
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V.   BINOMIAL  COUNTING  MODEL 

A.  INTRODUCTION 

The  Binomial  Distribution  provides  a  means  for  calculating 
expected  values  and  variances  of  sums  of  random  variables  that 
have  one  of  two  outcomes:  success  or  failure.  This  study  is 
concerned  with  estimating  the  number  of  enlisted  who  separate 
on  any  given  month  (or  conversely  the  number  who  do  not 
separate).  This  chapter  develops  a  simple  model,  based  on 
principals  of  the  Binomial  Distribution,  that  calculates 
monthly  survival  figures  per  paygrade,  and  provides  an  example 
of  how  the  model  works.  The  primary  reference  is  Chapter  3  of 
Reference  5. 

B.  BINOMIAL  DISTRIBUTION 

A  binomial  experiment  is  an  experiment  that  possesses  the 
following  properties: 

-  the  experiment  consists  of  n   identical  trials, 

-  each  trial  results  in  one  of  two  outcomes,  a  success  or 
failure, 

-  the  probability  of  success  on  a  single  trial  is  equal  to 
p  and  remains  the  same  from  trial  to  trial, 
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-  trials  are  independent,  and 

-  the  random  variable  of  interest  is  Y,    the  number  of 
successes  in  n   trials.   The  expected  value  of  Y  is 
E(Y)=np,  where  n  is  the  number  of  trials  and  p  is  their 
corresponding  probability  of  success.   [Ref.  5] 

A  person  separating  from  the  CG  meets  these  standards  and 
can  be  considered  a  Bernoulli  trial.  Each  term  trial  results 
in  one  of  two  outcomes.  A  success  is  marked  if  the  person 
does  not  separate,  a  failure  when  the  person  leaves  the  CG. 
The  product -limit -estimate  of  the  survivor  function  provides 
the  probability  of  a  person  surviving  to  and  beyond  a  time,  t; 
P(Tst)  for  t=l,...oo.  This  probability  estimate  remains  the 
same  for  each  trial  (ie.  for  all  persons  with  length  of 
services  equal  to  time  t)  .  The  decision  by  a  person  to 
separate  (or  the  CGs  decision  to  separate  a  person)  is  not 
affected  by  another  persons  choice  and  thus  independent  of  any 
other  persons  decision.  And  finally,  Y  equals  the  number  of 
persons  not  separating  within  a  paygrade  who  have  survived 
beyond  a  time  t.  Using  these  definitions  a  basic  model  can  be 
constructed  that  generates  an  expected  value  and  variance  for 
the  number  of  persons  that  do  not  separate  on  a  given  month. 
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C.   COUNTING  MODEL 

Paygrades  are  partitioned  in  two  steps  to  form  data  cells 
that  consist  of  independent  and  identical  trials  required  of 
the  Binomial  Distribution.  The  first  step  separates  each 
paygrade  into  the  single  personnel  characteristic  examined  in 
the  survival  analysis.  These  will  be  called  sets.  The  second 
step  further  partitions  the  paygrade  by  the  amount  of  time 
each  person  has  served,  or  Time  in  Service  (TIS)  cells. 
Figure  14  provides  a  tree  diagram  example  of  this  partitioning 
scheme. 


PAYGRADE 

SETi 

SET2 

'1 

CELLi 

CELLa 

CELLt 

•  ♦ 

1 

CELLi 

CELLa 

CELLt 

Figure  14.   Counting  Model  Partitioning  Scheme 


T  is  the  maximum  TIS  any  one  person  has  within  each  set. 
Paygrades  E-l  and  E-2  are  partitioned  by  race  to  form  two  data 
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sets  each;  paygrade  E-3  is  partitioned  by  MOS  obtainment;  and 
paygrades  E-4  through  E-9  are  partitioned  by  MOS  group.  Each 
cell  has  a  corresponding  probability  of  success,  Pj,  (either 
estimated  from  the  actual  data  or  generated  by  a  model)  which 
was  defined  in  Section  4.D.1  to  be  the  probability  of  survival 
being  greater  than  or  equal  to  t.  Given  nj,  the  expected 
number  of  persons  who  fall  into  cell  j  ,  the  expected  number  of 
successes  (persons  that  do  not  separate)  in  each  cell  is 

EiYj'zyPj     ;  j=l,...,T.  (4.7) 

The  cell  variance  is 

VariYJ^npft  (4.8) 


where  q^l-pj.   The  expected  number  of  successes  within  each 
set  is  the  sum  of  the  cell  expected  values 


T 

e^ysJ=Y,e{yj)  (4-9) 
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and  by  summing  cell  variances  the  set  variance  is 


T 

Var{Ytet)^Var(Yj)  (4.10) 


The  expected  number  of  successes  per  paygrade  is  calculated  by 
summing  set  expected  values .  Paygrade  variance  is  the  sum  of 
all  set  variances. 

Appendices  C  and  D  contain  procedures  for  generating 
product  limit  estimates  which  correspond  to  the  p^s. 

D .   EXAMPLE 

Data  from  paygrade  E-l  is  used  to  illustrate  this  counting 
model.  Paygrade  E-l  contains  2  paygrade  sets.  Setj  contains 
records  for  Caucasians  and  Set2  contains  records  for  non- 
caucasians.  Table  6  and  Table  7  on  page  32  contain  frequency 
counts  of  the  number  of  persons  falling  within  each  cell  per 
set . 
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TABLE  6.   PAYGRADE  CELL  FREQUENCY  COUNT  CAUCASIAN  E-l 


Month 

Freq 

Month 

Freq 

Month 

Freq 

Month 

Freq 

1 

53 

11 

2 

27 

1 

39 

3 

2 

339 

12 

3 

30 

2 

40 

2 

3 

148 

15 

3 

31 

1 

42 

2 

4 

8 

16 

2 

32 

1 

43 

1 

5 

2 

21 

4 

34 

1 

48 

1 

6 

1 

23 

3 

36 

1 

60 

1 

7 

1 

25 

2 

38 

1 

>60 

5 

TABLE  7.   PAYGRADE  CELL  FREQUENCY  COUNT  NON - CAUCAS I AN  E-l 

Month 

Freq 

Month 

Freq 

Month 

Freq 

Month 

Freq 

1 

2 

4 

6 

16 

1 

52 

1 

2 

87 

13 

1 

18 

1 

>52 

0 

3 

35 

14 

1 

20 

1 

Tables  8  and  Table  9  on  page  33  contain  the  estimated  Pj  values 
for  each  data  sets. 
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TABLE  8. 

PRODUCT  LIMIT  ESTIMATES  PAYGRADE  E-l  CAUCASIAN 

Month 

P 

Month 

P 

Month 

P 

1 

.88 

15 

.36 

27 

.256 

2 

.645 

16 

.345 

28 

.243 

3 

.5 

17 

.337 

29 

.23 

4 

.468 

18 

.33 

30 

.225 

5 

.441 

19 

.326 

31 

.22 

6 

.431 

20 

.318 

32 

.21 

7 

.424 

21 

.314 

33 

.2 

8 

.417 

22 

.294 

35 

.2 

11 

.417 

23 

.294 

36 

.1 

12 

.393 

24 

.286 

48 

.1 

13 

.389 

25 

.265 

61 

.05 

14 

.375 

26 

.256 

62 

0 

TABLE  9. 

PRODUCT  LIMIT  ESTIMATES  PAYGRADE  E-l  NON 

-CAUCASIAN 

Month 

P 

Month 

P 

Month 

P 

1 

.889 

13 

.225 

37 

.107 

2 

.646 

14 

.213 

39 

.107 

3 

.444 

15 

.173 

40 

.089 

4 

.403 

16 

.173 

43 

.089 

5 

.368 

17 

.173 

44 

.071 

6 

.344 

18 

.159 

48 

.071 

7 

.344 

19 

.159 

49 

.054 

8 

.332 

20 

.143 

64 

.054 

10 

.332 

30 

.143 

65 

.027 

11 

.249 

31 

.125 

66 

0 

12 

.249 

36 

.125 
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Substituting  hj  the  values  from  tables  6  and  7  for  the  nj 
values  and  pj  the  estimated  probability  values  from  Tables  8 
and  9  into  equation  (4.9),  the  estimated  expected  value  of 
each  set  is 


60 

E(*i)  =£iy3,.=353.475 


52 

E(y2)  =524^=76.905 


where  set!  corresponds  to  Caucasians  and  set2  corresponds  to 
non- Caucasians.  Using  equation  (4.10)  the  Var(Yj)  =  129.5  and 
Var(Y2)  =  30.97.  The  overall  paygrade  estimated  expected 
value  and  variance  is  the  sum  of  the  set  values 


BOW  =£E(r,)  =430.38 


/=i 


2 

I 


Var(YE_x)  =52  VariYf)  =160.47. 


For  working  purposes  these  values  are  rounded  down, 
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VI.   CONCLUSIONS  AND  RECOMMENDATIONS 

A.   CONCLUSIONS 

There  are  approximately  30,100  enlisted  persons  in  the  CG 
at  any  given  time.  A  method  is  needed  to  predict  monthly 
attrition  rates  to  maintain  the  CG  enlisted  hierarchical 
structure.  Survival  analysis  was  used  to  examine  the 
characteristics  of  sex,  marital  status,  race,  and  MOS  group 
behavior  of  CG  enlisted  personnel.  The  results  were  fairly 
interesting.  Males  and  married  persons  tended  to  have  higher 
probabilities  of  survival  than  did  females  and  individuals  who 
were  not  married.  Non- Caucasians  in  most  of  the  paygrades  had 
higher  survival  probabilities  than  did  Caucasians.  MOS  groups 
demonstrated  different  characteristic  behaviors  among 
different  paygrades.  MOS  groups  in  paygrades  E-4  and  E-5 
behaved  about  the  same  up  to  48  months.  After  48  months  all 
groups  showed  a  rapid  decline  in  survivor  probabilities  with 
the  aviation  group  having  the  highest  probabilities  and  the 
technical  group  having  the  lowest.  This  drop  is  indicative  of 
the  end  of  a  four  year  enlistment  when  individuals  can  usually 
separate  from  the  CG.  MOS  groups  in  paygrades  E-6  through  E-9 
all  had  very  high  probabilities  of  survival  and  demonstrated 
a  decline  at  certain  months.  E-6  showed  this  decline  at  about 
200  months  and  E-7  through  E-9  demonstrated  this  at  about  240 
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months.  The  drop  in  probabilities  at  240  months  is  easily 
explained.  Many  Coast  Guardsmen  retire  at  20  years  and 
receive  a  substantial  pension  and  many  important  benefits  (ie. 
medical  coverage  for  the  person  and  his  or  her  dependents)  . 
The  drop  at  200  months  in  paygrade  E-7  is  an  anomoly.  An  E-7 
at  this  point  in  his/her  career  has  almost  17  years  of  service 
time  and  has  only  about  three  more  years  to  serve  before 
retirement.  Separating  at  17  years  seems  to  be  financially 
inappropriate  because  the  person  will  not  receive  a  pension 
nor  benefits  if  they  separate. 

Modelled  survivor  functions  were  developed  for  paygrades 
E-8  and  E-9  because  these  paygrades  possessed  small  population 
sizes.  Despite  the  fact  that  the  models  produced  generally 
poor  fits,  it  was  felt  that  these  survivor  functions  would  be 
more  reliable  than  survivor  functions  generated  from  very 
small  data  sizes.  Therefore  models  were  used  to  calculate 
attrition  in  paygrades  E-8  and  E-9. 

The  binomial  counting  model  provides  an  effective  tool  for 
calculating  monthly  attrition  figures.  Expected  values  and 
variances  can  be  calculated  by  partitioning  each  paygrade  into 
several  binomial  experiments .  This  model  is  easy  to  use  and 
can  be  expanded  in  future  studies. 

B .   RECOMMENDATIONS 

As  a  first  study  the  goals  of  this  thesis  were  met. 
However  further  studies  are  required  to  better  analyze  CG 
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enlisted  attrition.  The  counting  model  could  be  adjusted  or 
refined  based  on  findings  from  the  recommendations  listed  in 
this  section. 

1.  Recommendation  1 

The  counting  model  needs  to  be  verified.  Personnel 
data  from  early  months  of  FY  92  need  to  be  processed  through 
the  model  to  verify  that  it  provides  better  results  than  the 
current  calculation  method. 

2 .  Recommendation  2 

Additional  historical  data  needs  to  be  analyzed  to 
provide  temporal  stability  to  the  study.  The  counting  model 
produced  in  this  study  assumes  that  the  characteristic 
behaviors  of  individuals  identified  in  FY  91  will  repeat 
themselves  in  FY  92. 

3 .  Recommendation  3 

A  more  manageable  data  base  needs  to  be  identified 
that  allows  for  accessible  man-machine  interface.  An 
accessible  data  base  allows  the  analyst  to  verify,  manipulate, 
and  easily  examine  data  entries  as  an  on- going  task.  This 
provides  data  reliability.  The  CG  PPC  is  not  responsible  for 
maintaining  this  type  of  data  base,  thus  PWP  should  examine 
this  issue. 

4.  Recommendation  4 

Individual  MOSs  within  MOS  groups  need  to  be  examined 
to  ensure  they  demonstrate  similar  attrition  behaviors. 
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5 .  Recommendation  5 

Combined  personnel  characteristics  need  to  be  analyzed 
for  survival  behaviors  and  seasonal  attrition  trends  need  to 
be  examined . 

6 .  Recommendation  6 

The  data  base  needs  to  be  extended  in  time.  At 
present  there  is  no  justification  for  assuming  time 
homogeneity  of  rates.  Such  an  extension  would  also  allow  the 
study  of  some  important  exogenous  variables,  such  as 
unemployment  rate  and  inflation. 

7 .  Recommendation  7 

The  educational  levels  of  Coast  Guardsmen  should  be 
analyzed  for  their  effect  on  attrition  behaviors.  High  school 
graduates,  alternate  high  school  degree  holders,  and  non-high 
school  graduates  need  to  be  considered  to  determine  if  there 
are  differences  in  attrition  behaviors  amongst  these  groups. 
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APPENDIX  A.   PROCEDURES  FOR  ACCESSING  DATA 

1.  Data  is  stored  on  two  magnetic  disks.   To  access  disks: 

a.  login  to  mainframe  computer 

b.  type  "getmvs  mvs215  (separation  data  tape) "  or 
"getmvs  mvs217"  (active  duty  data  tape) 

2.  To  run  a  program  that  has  a  huge  output  file,  ie.  a  file 
that  will  not  fit  on  an  A-disk: 

a.  type  "tdisk  6"  to  get  a  temporary  disk  with  6 
cylinders 

b.  switch  A-disk  with  temporary  disk  (  say  temp  disk  was 
assigned  "D  130" 

1.  type  "access  130  A" 

2.  type  "access  191  D" 

c.  copy  program  from  the  now  D-disk  to  the  newly  created 
A-disk 

d.  run  program 

e.  switch  disks  back  when  finished  by  reversing  procedure 

in  2.b 
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APPENDIX  B.   FIGURES 
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Figure  1.   Survivor  Function  Paygrade  E-1  by  Marital  Status 
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Figure  2.   Survivor  Function  Paygrade  E-4  by  Sex 
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Figure  3.   Survivor  Function  Paygrade  E-1  by  Race 
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Figure  4.   Survivor  Function  Paygrade  E-2  by  Race 
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SURVIVAL  FUNCTION  PAYCRADE  E-3  BY  RATE 
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Figure  5.     Survivor  Function  Paygrade  E-3  by  Rate  Obtainment 
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Figure  6.   Paygrades  E-4,  E-5,  E-6  by  Rate  Group 
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SURVIVAL  FUNCTIONS  BY  RATE  GROUP 
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Figure  7.   Paygrades  E-7,  E-8,  E-9  by  Rate  Groups 
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Figure  8.   Model  Fit  Paygrade  E-8 
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PAYGRADE  E-8 
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Figure  9.   Model  Fit  Paygrade  E-8 
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Figure  10.   Model  Fit  Paygrade  E-8 
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PAYGRADE  E-9 
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Figure   11.      Model   Fit  Paygrade  E-9 
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Figure  12.      Model   Fit  Paygrade  E-9 
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Figure   13.      Model   Pit   Paygrade  E-9 
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APPENDIX   C.       PROCEDURES    FOR   GENERATING   SURVIVAL    FUNCTIONS 


FILE:    LIFETEST   C0DC2  Al 


TITLE'E-2   IV  DACE'    /•    THIS   SAS  CODE    TAKES   A   DATA   SET   AND   GENERATES 
SURVIVOR   FUNCTIONS   FOR    THE   DATA.    THIS   CODE   HAS   USED  FOR   PAYGRADE   E-2 
AND    CREATES    SURVIVAL    FUNCTIONS    FOR    CAUCASIANS    AND    NON-CAUCAS I ANS  .  ■/ I 

DATA  P02i 

INPUT   RATE   RANK   SEX   ACE   HAR   RACE   MONTHS   CEN   /"THIS   LINE   DEFINES   THE 
VARIABLES    MITHIN    THE    DATA    SET. '/I 

GROUPi  (RACE'S)    /'THIS    SEPARATES    ALL    CAUCASIANS    FROM    NON-CAUCASIANS. 
S    INDICATES   CAUCASIAN.    THIS  SPLITS   THE   DATA   SET    INTO   2   SETi    1    CONTIANS 
CAUCASIANS.    THE   OTHER   CONTAINS   ALL   OTHER   MINORIT ITES. "/I 

CARDS   /'TELLS   SAS   THAT   THE   DATA  SET   FOLLOWS.    DATA   SET   MUST    BE   PLACED 
IMMEDIATELY    AFTER    THIS    LINE    AFTER    THE    SEMICOLON.     AND    MUST    HAVE    A 
SEMICOLON   PLACED    IMMEDIATELY  AFTER   THE   LAST   FILE."/    I 


PROC   LIFETEST   PLOT.(S)   MAXTIME««$   OUTSERV'PO^EST   /-THIS    IS   THE   MAIN   CODE 
II    CREATES   A   SURVIVAL   FUNCTION  PLOT    (S).    AND   LIMITS    THE   X-AXIS   Or    TmE 

PLOT    TO   t8   MONTHS.    ALSO   STORE   THE   PRODUCT   LIMIT   ESTIMATES    IN   A   SAS   FILE 

CALLED   PC2EST."/        : 

TIME   MOf.TMS'CEN(l)   /"DEFINES   THE   TIME   VARIABLE    (MONTHS).    AND   SETS    THE 
CENSORING   VARIABLE   WITHIN    THE   DATA   SET   EOUAL    TO    1    •/      1 

STRATA   GROUP    /"PERFORMS   THE   PROCEDURE   ON   ALL   DEFINED  CROUPS.    IN   THIS   CASE 
ThERE   ARE   ONLY   2   CROUPS"/         I 

PROC   PRINT   DATAiPOIEST   /"PRINTS   THE   DATA   SET   P02EST    INTO  A   CMS   FIlE"/i 


FILE:    LIFETEST   CODE 


TITLE 'PAYGRADE   E-»   BY   RATE   CROUP    ■    /"SAME   AS   PREVIOUS   CODE   EXCEPT 
IT    DIVIDES    THE   PAY&RADE    INTO  S   GROUPS:    I    FOR   EACH  MOS   CROUP.    CROUP    1 
IS    TME   AVIATION  PEOPLE,    2-ADMIN    . J. TECHNICAL.    <«SKILLED.S"CG   PRO   •/    I 

DATA   PO»l 

INPUT   RATE   RANK   SEX   AGE   MAR   RACE   MONTHS   CENi 
IF    KA!E  =  SK)|RATE  =  S70IRATE'S20IRATE«SJ0IRATE--S50   THEN  GP-li 
ELSE    IF   RATE=!40lRATE«420   THEN  GP«2i 

ELSE    IF   RATE»7*0IRATE'18OIRATE>l«0IRATE«2?0IRATE'2«OIRATE-28O   THEN  GP"Sl 
ELSE    IF    RATE=S00lRATE*8«0IRATE'§70   THEN   GP««; 
ELSE   GP>S; 
CARDS: 
1 
PROC   LIFETEST   PLOT"(S)   MAXTIME'JOO      OUTSERV"PO»EST I 
TIME   MONTMS"CEN(i)i 
STRATA    GP; 
PROC    PRINT    DATA.POIESTl 
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PROGRAM    M0DFIT2 

THIS   PROGRAM   GENERATES   MODELLED   PRODUCT   LIMIT    ESTIMATES   FOR 
PArGRADES   WHICH   HAVE    BEEN  RARTITIOHED   BY   S   MOS   GPOUPSi    AVIATION. 
ADMINISTRATIVE.    TECHNICAL.    SKILLED.    CG   PROFESSIONAL.       IT   USES   AS 
INPUT   REGRESSION   COEFFICIENTS    (BETA)    THROUGH   BETAIS).    SHARE 
PARAMETERS    (PI    AND  P2).    SCALE   PARAMETERS    (LAMBDAI    AND  LAH8DA2). 
AND   A   TIME   PARAMETER    (T).    THIS   PROGRAM    IS   ALSO   DESIGNED   TO  WORK 
ON   DATA    THAT    FIT    A    3   PIECE   WEIBULL.      VARIABLES   BETA]    THRU  BETA*. 
PI.    AND   LAMBDAI    ARE   VALUES   FOR    THE   FIRST   WEIBULL   PIECE    IP    IS   THE 
TIME    VALUE   OF   WHERE    THE   FIRST   BREAK   OCCURS).    VARIABLES  BETAS   THRU 
BETAS.    P2.    AND   lambda?    (AND   M)   CORRESPOND   TO   THE   SECOND  PIECE. 
VARIABLES    BETA*    THRU  BETAI2.    PJ.    LAMBDAS    (AND  N)   CORRESPOND  TO   TM* 
LAST   WEIBULL   PIECE.      VARIABLE   BETA)]   EOUALS   2ER0   AND  CORRESPONDS 
TO   THE   CG   PROFESSIONAL   COEFFICIENT.       THE   VARIABLE   SHAT    IS   THE 
MODELLED   PRODUCT   LIMIT   ESTIMATE:    SHATI    FOR    THE   AVIATION  HOS 
GROUP.    SHAT2   FOR    ADMIN  GROUP.    SHATJ   FOR    TECHNICAL   GROUP.   SHAT*    FOR 
THE    SKILLED   GROUP.    AND   SHATS    FOR    THE   CG   PRO   GROUP.      SHAT   VALUES   AR 

CALCULATED   FOR    TIME   PERIODST.    (T-l N).    WHERE   M   1A  THE   LAST 

TIME    VALUE   NEEDED   FOR    THE   PAVGRADE. 

THE   ESTIMATES   ARE   WRITTEN   ON   THE   CMS   FILESi    FT8BF001.    FTt»F001. 

AND   FT»0F00l. 


INTEGER    I.M.N.P 

REAL   LAMBDA] .PI  .BETA] . BETA? .BETA J .BETA* .BETAS .BETA! J 
REAL   LAMBDAr.P:.BE'A*.BETA7.BETA«,BETA».BETAI0.BETAII.| 
REAL    SHAT 1 . SHAT2 . SHATS . SHAT* .SHATS .LAMBDAS .P J 

FOLLOWING   ESTABLISHES   ALL   VARIABLE   VALUES. 

PARAMETER (M. 10. LAMBDA  I i.l 00?    .P|-2.«BS> 

PARAMETERIBETAI-0.0.BETA2  — .H58.BETA5  —  .  CBS  J  .  BETA*  —  .  J 

PARAMETER(N>200    .L»MBDA2>0 . 0OO0J    .P2-«.«SS> 

PARAMETER  (BETAS' -.05*S. BETA*..  1H.BETA7.-.  OS  l».BETAB.-« 

PARAHETER(P-«20. LAMBDAS". 0028    ,PJ. 8.055«    ) 

PAR  AM£TER(  BETA*..  025*. BETA10-.  0507.  BETA11-.I04. 8ETA12  — 

PARAMETERIBETAIJ-0.0        ) 

CALCULATES   SHATS   FOR   FIRST  WEIBULL   PIECE. 

DO    10    1*1  .M 

SHATl-EXP(-(  (LAMBDAI- 1 -EXP  (-BET  All)  "PI)) 
SHAT2-EXP(-(  (LAMBDA  1-I-EXP(-BETA;))"P|>) 
SHATS  «EXP(-(  (LAMBDAI-  I-EXP(-  BETAS))  "Pin 
SHAT*.EXP(-(ILAH8DA1-I«EXP(-BETA«))"P|  J) 
SHATS  -EXP  (-((LAMBDAI- 1  -EXP  (-BETA1  5  ))"P]  )) 
WR  ITE  (88.  >1) I. SHAT I. SHAT2. SHATS. SHAT*. SHATS 
10  CONTINUE 


CALCULATES  SHATS  FOR  SECOND  WEIBULL  PIECE. 

DO  20  I.MM.N 

SHATI  -EXP  t-((L*M»DA2-l  -EXP (-BETA5))"P2)> 
SHAT2-EX°(-((LAMBD*2-I-EXP(-BETA4))--p;)) 
SHAT5-EXP(-((L»MBDA2-I-EXP(-BETA7))--P2)) 
SHAT*»EXP(-( (L«MBDA2-1-EXP(-BETA3))"P2)) 
SHATS>EXP(-((L*HBDA2-I-EXP<-BETA15))"P2)) 
WRITEre*.»l)!.SHATl.SHAT2.SHATS.SHAT*.SHATS 
20  CONTINUE 

CALCULATES  SHATS  FOR  LAST  PIECE. 

DO  JO  1-N.l.P 

shat  1 -exp  (-((lambdas- 1 -exp  (-beta*  )>"p5)> 
Shat2»exp(-((lambda;«i-exp(-betaio>)"PJ)) 
shat5 -exp (-( (lambdas- 1 -exp (-beta11)).. ps)) 
shat«-exp(-((lamb0as-i-exp(-beta12))--pjj) 
shats-exp(-((lam8das-i-exp(-beta15))"ps)) 
wri  teo0.81)  i.  shati.  shat2. shatj  .  shat*.  shats 
50  continue 

»1  FORMAT(lX.IJ.JX.F».*,JX.Ft.*.JX.F«.«.JX.Fi.«,JX.F«.«) 
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APPENDIX   D.       PROCEDURES    FOR   CONDUCTING   REGRESSION  ANALYSIS 


PROGRAM   REGrREP 


C  THIS   CODE    SEPARATES   A   PAVCRADE   DATA   SET   BY   HOS   GROUP    AMD  CREATES 

C  DATA   SETS   WEEDED   TO   CONDUCT    THE   SAS   LIPEREG   PROCEDURE.    AGAIN   THIS 

C  CODE    IS   DESIGNED   FOR   A   5   PIECE   WE1BULL.    THE    INPUT    PILE    («2> 

C  CONTAINS    THE   PAVGRADE   DATA    SET.      J   OUTPUT    DATA   SETS   ARE   CREATED   TO 

C  CONFORM   TO   THE    5    PIECE   WE1BULL.    THE   VARIABLE   N   IS   THE   NUMBER   OF 

C  FILES    IN   THT   PAVGRADE. 

INTEGER     I.N. RATE. MONTHS. CEN 
PARAHETER(N't<8) 
0rEN<r.F!LE"'PO8    DATA   A') 
DO    10    IM.N 

READ(2.»2)   RATE, MONTHS. CEN 

C  THIS    IS    THE   FIRST   PIECE   OF    THE   WEIB'JLL 

IF    (MONTHS     .LE.     10)    THEN 

IFIRATE    .EC.    520    .OR.    RATE    .EO.    5(0    .OR.    RATE    .EO.570    .OR. 
1  RATE    .EO.    SJO    .OR.    RATE    .EO.    S501THEN 

WRITE(2fr.»S)    MONTHS. CEN 
ELSE1FIRATE    .EO.    JtO    .OR.    RATE    .EO.    «20)    THEN 

WRITE(2t.»<)    MONTHS. CEN 
ELSEIFIRATE    .EO.    270    .OR.    RATE    .EQ.    240    .OR.    RATE    .EO.    180 
1  .PR.    RATE    .EO.    7*0    .OR.    RATE    .EO.     UO    .OR.    RATE    .EO. 

1  2B0)    THEN 

MRITECt.'S)    MONTHS. CEN 
ELSEIFIRATE    .EO.    B70    .OR.    RATE    .E3.    500    .OR.    RATE    .EO.    8<0)THEN 
WRITE(2t.»t>    MONTHS. CEN 
ELSE 

WR!TE(2».»7)    MONTHS. CEN 
END  IF 

C  THIS    IS    THE    SECOND   PIECE   OF    THE   WEIBULL. 

ELSEIFIMCNTHS    .GT.    10    .AND.    MONTHS    .LE.    200HHEN 

IFIRATE    .EO.    520    .OR.    RATE    .EO.    5«0    .OR.    RATE    .EO.570    .OH 
I  RATE    .EO.    SJO    .OR.    RATE    .EO.    SS01THEN 

MRITEI27.*J)   MONTHS. CEN 
ELSEIFIRATE    .EO.    JtO    .OR.    RATE    .EO.    <20>    THEN 

MRITE(27.»4)   MONTHS. CEN 
ELSEIFIRATE    .EO.    270    .OR.    RATE    .EO.    240    .OR.    RATE    .EO.    180 
1  .OR.    RATE    .EO.    7*0    .OR.    RATE    .EO.    l«p    .OR.    RATE    .EO. 

I  280)    THEN 

WR|TE<27,«)   MONTHS. CEN 
ELSEIFIRATE    .EO.    870    .OR.    RATE    .EO.    500    .OR.    RATE    .EO.   8«0>THEN 
WR1TEI27.M)   MONTHS. CEN 
ELSE 

WRITE(27.*7)   MONTHS. CEN 
END  IF 


THIS    IS   THE   LAST   PIECE   OF    THE   MEIBUU.. 

ELSE 

IFIRATE    .EO.    520    .OR.    RATE    .EO.    540    .OR.    RATE    .EO.570    .OR. 
1  RATE    .EO.    550    .OR.    RATE    .EO.    SSOITHEN 

HRITEC8.*J)    MONTHS. CEN 
ELSEIFIRATE    .EO.    JtO    .OR.    RATE    .EO.    «20)    THEN 

WRITECB.m    MONTHS   CEN 
ELSEIFIRATE    .EO.    270    .OR.    RATE    .EO.    2«0    .OR.    RATE    .EO.    180 
1  .OR.    RATE    .EO.    7»0    .OR.    RATE    .EO.    ltO    .OR.    RATE    .EO. 

1  28  0)    THEN 

WRITE (28. f5)    MONTHS. CEN 
ELSEIFIRATE    .EO.    870    .OR.    RATE    .EO.    500    .OR.    RATE    .EO.    8<0)THEN 
WRITEI28.H)    MONTHS. CEN 
ELSE 

WRITEI28.V)    MONTHS. CEN 
END  IF 
ENDIF 
10   CONTINUE 

THIS    IS    THE   FORMAT    FOR    THE   READ/INPUT   STATEMENTS. 


•  2   FORMATUK.IS.lSX.lS.lX.il> 

THESE   FORMAT   STATEMENTS   CREATE   REGRESSION   DATA   SETS.    MONTHS    IS    THE 
DEPENDENT   VARIABLE    AND    IS   LOCATED    IN   THE   FIRST    COLUMN.    THE 
PRECEDING   COLUMNS   CONTAIN   A   0   OR    1    IF    THE    PERSON   FALLS    INTO   ONE 
OTF    THE   MOS   GROUPS.    ITHE   STH  MOS   GROUP    IS   ASSUMED   AND   DOES   NOT    SHO 
IN   THE   DATA   SET. 


»J  FORMAT  I IX. 1 5. IX. '1'. IX. '0'. IX.' 
»«  FORMAT  I IX. 1 5. IX. '0'  , IX.'1'.IX. 
«  FORMAT (IX. IS. IX. '0' . IX.'O'.IX. 
H  FORMATUX, IS. IX. '0*. IX.'O'.IX. 
»7  FORMAT (IX. IS. IX. '0'  .IX.'O'.IX. 
END 


0' 

.IX. 

•0' 

.IK. 11) 

o- 

.IX. 

•0' 

.IX. Ill 

r 

.IX. 

•o- 

.IX. 11) 

0' 

.IX. 

•r 

.IX. 11) 

0' 

,1X. 

•0' 

•IX. 11) 
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FILE:  LIFEREG  CODE     Al 


TITLE  'PEC-SESSION  OUTPUT  PAYGRADE  E-f  /«S»S  CODE  PERFORMS  A  PECRESSION 
ON  CENSORED  DATA.  GENERATES  REGRESSION  COEFFICIENTS.  SHAPE  AND  SCALE 
PARAMETERS.  OUTPUT  IS  VERV  SMALL.  •/  I 

DATA  POil 

INPUT  MONTHS  AVIATION  AOMIN  TEC*  SKILLED  CENi 

CARDSi 

i 

PROC   LIFEREG  OUTEST. MODEL i 

CLASS   AVIATION   ADMIN   TECH   SKILLED   /"SETS    THE   VARIABLES   AVIATION. ADMIN. 
TECM.    AND  SKILLED  AS  CATEGORICAL    (0   OR    1  )    VARIABLES    •/    I 

MODEL   MOVTHS'CENC  DEVIATION  ADMIN   TECM   SKILLED/DIST.UEIBULL    /"SETS   THE 
DEPENDENT   VARIABLE   EOUAL    TO  MONTH,    SETS    THE   CENSORING   VAR IABLE -CENt 1 . 
SETS   THE    INDEPENDENT    VARIABLES  EOUAL   TO   AVIATION.    ADMIN.    TECH.    SKILLED. 
ESTABLISHES   THE   WEIBULL   AS   THE   PARAMETRIC   DISTRIBUTION   ■/      1 
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